Skip to content

Conversation

@Zouxxyy
Copy link
Contributor

@Zouxxyy Zouxxyy commented Jul 10, 2023

Change Logs

The following is the current configuration of bootstrap's mode selection:

  • hoodie.bootstrap.mode.selector
  • hoodie.bootstrap.mode.selector.regex
  • hoodie.bootstrap.mode.selector.regex.mode

Fix #6673 (comment), The correct use of hoodie.bootstrap.mode.selector is:

  • set hoodie.bootstrap.mode.selector as BootstrapRegexModeSelector, specify partitions by hoodie.bootstrap.mode.selector.regex, then set hoodie.bootstrap.mode.selector.regex.mode for the specify partitions

  • set hoodie.bootstrap.mode.selector as MetadataOnlyBootstrapModeSelector or FullRecordBootstrapModeSelector

Impact

Correct the use of hoodie.bootstrap.mode.selector

Risk level (write none, low medium or high below)

low

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@Zouxxyy
Copy link
Contributor Author

Zouxxyy commented Jul 11, 2023

CI failure is caused by other, Can you help with a review? @codope

@Zouxxyy
Copy link
Contributor Author

Zouxxyy commented Jul 22, 2023

@danny0405 @codope Can someone help with a review? It's a bug fix, before it the usage and test cases of hoodie.bootstrap.mode.selector are wrong. And I've already resolved one conflict...

Copy link
Member

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Zouxxyy Can you tell me what scneario is fixed by this PR?
As per my understanding, hoodie.bootstrap.mode.selector can take one of three classnames:

  1. BootstrapRegexModeSelector
  2. MetadataOnlyBootstrapModeSelector
  3. FullRecordBootstrapModeSelector
    If that config is set to BootstrapRegexModeSelector then hoodie.bootstrap.mode.selector.regex is used (default value .* means all partitions).
    And then there is a config hoodie.bootstrap.mode.selector.regex.mode which can be either FULL_RECORD or METADATA_ONLY. This config cannot be set to METADATA_ONLY if we are using FullRecordBootstrapModeSelector. The current logic seems to cover different scenarios. Are we missing a corner case?

@Zouxxyy
Copy link
Contributor Author

Zouxxyy commented Jul 23, 2023

@codope

And then there is a config hoodie.bootstrap.mode.selector.regex.mode which can be either FULL_RECORD or METADATA_ONLY. This config cannot be set to METADATA_ONLY if we are using FullRecordBootstrapModeSelector.

see BootstrapRegexModeSelector
image

hoodie.bootstrap.mode.selector.regex.mode is only used with BootstrapRegexModeSelector, It has nothing to do with MetadataOnlyBootstrapModeSelector or FullRecordBootstrapModeSelector

@apache apache deleted a comment from hudi-bot Jul 23, 2023
@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants